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Human  Plausible  Reasoning 
Executive  Summary 


During  the  second  year  of  the  contract  our  work  centered  in  four  areas: 


l.We  completed  a  computer  model  embodying  the  theory  of  plausible  reasoning 
developed  in  the  paper  by  Collins  and  Michalski  entitled  "The  Logic  of  Plausible 
Reasoning:  A  Core  Theory"  to  be  published  in  Cognitive  Science.  The  simulation 
model  was  developed  by  Michelle  Baker  and  Mark  Burstein,  and  is  described  in 
detail  in  the  rest  of  this  report. 


2.  We  wrote  a  paper  describing  the  simulation  model  entitled  "Implementing  a 
Theory  of  Human  Plausible  Reasoning"  by  Michelle  Baker,  Mark  Burstein,  and 
Allan  Collins,  which  was  presented  at  IJCAI  tn  Milan  Italy,  and  appears  in  the 
Conference  Proceedings  of  IJCAI- 10,  1987.  This  paper  constitutes  the  bulk  of  this 
report. 

3.  We  constructed  two  small  data  bases,  one  on  grain  growing  (shown  in  Table  l 
below)  and  one  in  economics.  These  were  implemented  in  the  system  in  order  to 
test  out  what  plausible  inferences  the  system  draws  given  incomplete  information 
about  a  given  domain.  In  addition  to  the  kind  of  data  shown  in  Table  1,  various 
mutual  dependencies  (e.g.  precipitation  A  irrigation  <->  water  supply)  were  also 
included  in  the  data  base  in  order  to  constrain  the  plausible  inferences  drawn. 


4. 


We  ran  four  expert  reasoners  with  little  knowledge  of  geography  in  an  experiment 
using  the  grain  growing  data  base  shown  in  Table  1  below.  Subjects  were  asked  to 
specify  first  what  mutual  dependencies  between  the  variables  shown  they  knew 
about  a  priori.  Then  they  were  asked  to  try  to  guess  the  values  of  the  unspecified 
variables  and  to  explain  the  basis  of  their  reasonuig.  Their  plausible  uiferences 
will  be  directly  compared  to  the  plausible  inferences  made  by  the  computer  model 
over  the  same  data,  and  where  there  are  systematic  differences  the  computer  model 
will  be  refined  accordingly.  . . — - — - — 
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1.  INTRODUCTION 

Over  the  last  15  years,  Collins  and  his  colleagues  (Carbonell  and  Collins,  1973,  Collins  et  al..  1975, 
Collins, 1978a,  Collins,! 978b)have  collected  and  categorized  a  wide  variety  of  .human  plausible  inferences 
made  from  incomplete  and  inconsistent  information.  This  work  led  to  the  development  of  a  partial  theory 
of  plausible  inference  (Collins  and  Michalski,  in  press)  for  situations  where  the  most  appropriate  or 
specific  information  was  not  available.  This  paper  describes  some  current  work  in  progress,  the 
development  of  a  computer  simulation  of  a  portion  of  that  theory.  Our  goal  is  to  use  the  simulation  as  a 
means  of  testing  and  refining  the  theory. 

The  popularity  of  expert  systems  has  generated  great  interest  in  developing  techniques  to  reason  with 
uncertain  information.  To  date,  research  on  reasoning  under  uncertainty  has  emphasized  the  role  of 
statistical  theory.  (Pearl,  1986,  Duda  et  al.,  1976).  Unfortunately,  in  most  real-world  problems  neither  the 
data  nor  the  inference  rules  themselves  are  known  to  apply  with  precise  certainties.  Methods  of 
combining  uncertain  evidence  from  multiple  sources  are  also  often  required.  With  the  exception  of  Cohen 
(Cohen,  1985),  it  has  usually  been  assumed  that  the  appropriate  certainty  parameters  and  the  methods  of 
combination  were  independent  of  the  type  of  inference  performed.  Furthermore,  these  techniques  usually 
require  some  form  of  closed  world  assumption  for  correct  interpretation.  Unfortunately,  in  most  real-world 
situations,  the  available  information  is  incomplete  as  well  as  uncertain.  People  deal  with  this  problem 
continually,  and  quite  effectively,  using  techniques  for  reasoning  by  similarity,  reasoning  from  negative 
information,  and  reasoning  from  their  own  tack  of  knowledge  about  particulars  (e.g.,  *1  would  know  it  if 
Ronald  Reagan  was  10  feet  tall.*)  It  is  these  kinds  of  inferences  that  we  seek  to  model. 

Collins’  theory  of  plausible  reasoning  is  based  on  a  corpus  of  people’s  answers  to  everyday  questions 
(Collins, 1 976b).  In  general,  he  found  that  these  answers  had  the  following  characteristics: 

1.  There  are  usually  several  different  inference  types  used  to  answer  any  question. 

2.  The  same  inference  types  recur  in  many  different  answers. 

3.  People  weigh  different  evidence  (and  different  kinds  of  evidence)  they  find  that  bears  on  a 
question. 

4.  People  are  more  or  less  certain  depending  on  the  certainty  of  their  information,  the  certainty 
of  the  inferences  used,  and  on  whether  different  inferences  lead  to  the  same  or  opposite 
conclusions. 

Also  apparent  from  the  protocols  is  that  subjects  faced  with  answering  a  question  for  which  they  have 
no  specific  knowledge  launch  a  search  for  relevant  Information  that  they  do  have.  As  relevant  pieces  of 
information  are  found  (or  are  found  to  be  missing),  they  trigger  particular  types  of  inferences.  The  type  of 
inference  applied  is  determined  by  the  relation  between  the  information  found  and  the  question  asked 
For  example,  when  a  tutor  was  asked  whether  they  grow  coffee  in  the  Llanos  region  of  Colombo,  he 
responded: 

I  don't  think  that  tha  savanna  is  uaad  for  growing  coha*.  Tha  troubi*  *  tha  savanna  ha*  a  rainy  saaaen 
and  you  can't  count  on  rain  in  general.  But  I  don't  know,  thia  area  around  Sso  Paulo  (in  Brazil)  is  cofta# 
region,  and  it  is  sort  of  getting  into  the  savanna  region  thar*. 

Initially,  the  tutor  said  no  because  he  knew  that  coffee  growing  depends  on  factors  like  ranfaii. 
temperature,  soil.  etc.  and  the  savannas  do  not  have  the  correct  value  on  the  rainfall  factor.  (This  is  caned 
a  derivation  from  mutual  implication  in  the  theory.)  Secondly,  he  did  not  know  specifically  that  the  Llanos 
was  used  for  coffee  growing,  and  believed  that  he  would  know  if  it  was  (lack  of  knowledge).  Later,  he 
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backed  off  when  he  found  positive  evidence;  i.e.,  that  the  region  in  Brazil  was  near  an  area  where  coffee 
was  grown  (a  similarity  transform).  His  final  answer  weighed  ail  of  these  pieces  of  evidence  together, 
albeit  inexactly. 

In  the  remainder  o;;  this  paper,  we  will  describe  an  initial  implementation  of  one  part  of  Collins'  theory  of 
plausible  reasoning,  based  on  examples  like  this  one.  Initially,  we  have  concentrated  on  modeling  the 
class  of  functional  inferences,  where  the  inference  is  based  on  a  functional  dependence  such  as  that 
coffee  growing  depends  on  climate  and  vegetation. 

The  primary  purpose  of  the  system  is  to  act  as  a  testbed  for  the  theory.  As  such,  it  is  not  designed  to 
produoe  one  'right*  answer,  but  a  number  of  plausible  positive  and  negative  Inferences  each  of  which 
may  be  a  weak  (or  not  so  weak)  reason  for  believing  that  the  question  asked  could  be  answered  in  a 
particular  way.  Our  goals  are  to  demonstrate  that  the  theory  produces  only  plausible  answers,  to  develop 
ways  of  searching  memory  for  the  kinds  of  relevant  information  that  are  needed  in  order  to  apply  each 
inference  typ-\  and  to  investigate  methods  for  combining  the  various  kinds  of  evidence  that  are  produced. 

The  Plausible  Reasoning  Simulation  System  (PRSS)  we  have  developed  is  thus  quite  different  from 
other  systems  that  have  been  developed  to  reason  with  incomplete  and/or  uncertain  information.  Since  it 
is  intended  to  simulate  human  reasoning,  it  generates  multiple  proofs  of  both  the  truth  and  the  falsity  o’  a 
given  proposition.  The  types  of  inferences  made  depend  on  the  particular  information  found  in  memory, 
anc.  the  nature  of  their  relevance  to  the  question  asked.  Finally,  the  certainty  of  the  overall  conclusions 
reached  depends  on  both  the  certainty  of  the  evidence  and  the  types  of  inferences  used. 

2.  AN  EXAMPLE 

To  give  a  sense  of  the  behavior  of  the  simulation  system,  consider  how  It  behaves  when  asked  a 
question  like  ’Does  coffee  grow  in  Llanos?’. 


(?  crop  :of  llano*  :s  cotfaa) 

ITO  DIRECT  EVIDENCE  TOUND. 

TRYING  NEGATIVE  IMPLICATION  IVON: 

CROP  •  COFFEE  — >  SAINT  ALL  •  SIGN  (eartainty  .*) 

Sine*  RIGS  i«  not  a  known  value  Cor  RAINFALL (LLANOS) . 

and  set  of  value*  Cos  RAINFALL (LLANOS)  ia  CLOSED. 

Conclude  tha*  COSTEX  i*  not  »  value  tor  CROP  (LIANGS) 
with  KDIOM  certainty. 

TRYING  ARGOKSMT  RASED  DEPENDENCY  TP i MSTORMS . . 

LLANOS  and  SAO-PAOLO  match  on  CL  'KATE .  (sin  -  O.i) 

LLANOS  and  SAQ-PAOLO  match  on  VEGETATION.  (*ia  •  0.6) 

Gain <3  a  SIM  transform: 

Since  CLIMATE  and  VEGETATION  <— :  CRON 

and  SAD- PAOLO  ia  sieilar  to  LLAKQS  »ith  respect  to  CLIMATE 
end  VEGETATION.  (ale  ■  0.7) 
end  CROP (BAO-PAOLO)  -  COFFEE 

Conclude  that  CROP  (LLANOS)  -  COFFEE  is  TRUE  with  MEDIUM  certainty. 


Evidence  ia  evenly  mixed .  I  cannot  ealco  a  judgement . 
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For  this  example,  PRSS  finds  two  kinds  of  evidence.  First,  it  reasons  from  the  implication  that  coffee 
growing  requires  heavy  rainfall,  and  from  the  fact  that  it  does  not  believe'the  Llanos  to  have  heavy 
rainfall,  that  the  Llanos  is  not  a  coffee  growing  region.  This  conclusion  is  given  medium  certainty  primarily 
because  of  the  certainty  of  the  implication.  Secondly,  it  finds  that  the  SAO-PAULO  region  does  have 
coffee  as  a  crop  and  matches  Llanos  on  CLIMATE  and  VEGETATION,  two  variables  involved  in  a  mutual 
dependency  with  CROP.  Since  the  evidence  is  eveniy  divided,  no  final  conclusion  is  reached. 


3.  SYSTEM  OVERVIEW 

Unlike  an  expert  system,  which  must  generate  a  solution,  PRSS  tries  both  to  verify  and  disconfirm  each 
proposition  that  it  is  given  as  an  input  question.  Some  examples  of  the  kinds  of  queries  the  system  may 
receive  as  input  are: 

(?  CLIMATE  : OF  ENGLAND  :=  TEMPERATE) 

(?  FLOWER-TYPE  :OF  HOLLAND  :*  ROSE) 

(?  WATER-REQUIREMENT  :OF  ROSE  :»  HIGH) . 

The  system  responds  to  each  query  with  a  determination  of  whether  the  statement  is  TRUE  or  FALSE 
along  with  an  estimate  of  the  certainty  of  its  answer  and  an  explanation  of  its  reasoning.  When  presented 
with  a  query  the  system  first  checks  whether  it  has  the  answer  stored  directly.  If  so,  the  answer  is 
returned  along  with  the  certainty  that  was  recorded  at  the  same  time  the  tact  was  recorded.  If  it  does  not 
have  the  fact  stored  it  attempts  to  use  every  plausible  inference  for  which  it  has  adequate  information  and 
explains  what  it  is  doing  as  it  performs  each  inference.  The  evidence  from  each  plausible  inference  is 
then  weighed  together  to  generate  a  final  guess  of  TRUE  or  FALSE  along  with  the  estimated  certainty  of 
that  guess. 

In  general,  people  use  many  different,  possibly  independent,  arguments  to  convince  themselves  of  the 
truth  or  falsity  of  a  proposition.  It  is  a  bit  like  using  a  theorem  prover  that  returns  every  possible  proof 
Unlike  Bayesian  inference  networks  (Peart.  1986),  which  can  be  viewed  as  combining  probabilistic 
evidence  from  multiple  proofs  to  verify  the  truth  of  a  proposition,  our  system  tries  o  prove  both  the  truth 
and.  separately,  the  falsity  of  a  proposition  in  as  many  ways  as  are  possible  given  the  information 
available. 

Each  inference  made  by  PRSS  is  UKe  a  proof  in  that  it  may  require  backchaining  to  generate 
information  necessary  for  the  top  level  inference.  Each  top  level  inference  (i.e.  proof  based  on  uncertain 
information)  becomes  a  separate  bit  of  evidence.  Proofs  that  the  query  proposition  are  true  are  gathered 
together  as  evidence  for  the  proposition  and  proofs  of  falsity  are  pooled  as  evidence  against  the 
proposition.  Each  bit  of  evidence  has  a  certainty  parameter  that  has  been  derived  by  combining  the 
certainty  parameters  of  the  stored  propositions  used  and  parameters  that  measure  the  goodness  of 
matches  required  in  the  applications  of  inference  rules.  The  final  judgment  and  the  system's  certainty  of 
that  judgment  depend  on  the  certainties  of  the  evidence  arid  on  how  contradictory  the  evidence  was 
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4.  THE  KNOWLEDGE  BASE 

We  have  tried  to  model  the  system  on  the  behavior  of  people  when  generating  functional  inferences. 
This  has  required  a  highly  redundant,  crossreferenced  memory  organization.  The  knowledge 
representation  system  we  developed  for  this  purpose  provides  mechanisms  for  automatic  cross  reference 
of  every  input  proposition,  allowing  for  redundancies  in  set/subset  relations,  and  multiple  indexing  of 
declarative  inference  rules.  Collins  and  Mlchalski’s  theory  assumes  that  inferences  are  made  when 
relevant  information  is  found  by  a  parallel  search  for  information  associated  with  the  argument  and  the 
referent  of  the  query.  While  our  current  simulation  does  not  do  this  directly,  we  have  implemented  a  set 
of  specialized  search  routines  that  collect  all  information  potentially  useful  for  (possibly  several  of)  the 
inference  types  so  far  implemented. 

PRSS  has  a  database  consisting  of  propositional  knowledge  and  functional  relations  (implications  and 
mutual  dependencies),  organized  in  a  multiply-indexed  semantic  network.  In  the  existing  implementation 
each  proposition  is  a  binary  relation.  We  are  currently  working  on  extending  the  representation  to  include 
structured  objects  and  n-ary  relations. 

Collins  and  Michalski  (in  press)  identified  four  different  certainty  parameters  associated  with  the 
propositions  or  declarative  knowledge  in  this  network.  Two  parameters,  certainty  and  frequency  are 
associated  with  each  proposition  in  the  knowledge  base.  For  example,  we  might  have 

CLIMATE  (AFRICA)  »  TEMPERATE,  frequency  *  .3,  certainty  •  .9 

CLIMATE  (AFRICA)  ■  TROPICAL.  frequency  »  .5,  certainty  «  HIGH  . 


Following  the  notation  of  Collins  and  Michalfki  (in  press),  we  call  the  predicate  a  descriptor,  which, 
together  with  its  argument  (here.  AFRICA)  forms  a  term.  The  predicate  CLIMATE  is  the  descriptor, 
mapping  its  argument  (a  place)  to  various  referents  (values  for  climates).  The  certainty  parameter  is  a 
measure  of  degree  of  certainty  that  a  statement  is  believed  to  be  true.  The  frequency  parameter2 
measures  the  estimated  proportion  of  the  referent  out  of  all  possible  referents  for  that  descnptor  and 
argument.  The  example  above  represents  the  belief  that  30%  of  AFRICA  is  temperate  and  50%  is 
tropical.3 

In  addition  to  certainty,  a  likelihood  parameter  is  attached  to  each  implication  and  dependency  For 
example  we  might  have  the  dependency. 

Foe  all  Pisces  p. 

TEMPERATURE  (p)  <*—>  LATTITUDS(p) 

certainty  ■  .9  .likelihood  -  HIGH. 

where  the  likelihood  is  intended  to  be  a  measure  of  the  conditional  probability  of  the  right-hand  side 
given  the  left  hand  side.  For  an  implication  like  the  one  below,  it  is  a  measure  of  the  kkelthood  that  the 
right  hand  side  of  the  impfccaiion  is  in  the  given  range  when  the  left  hand  side  is  in  its  specified  range 

For  all  Places  p, 

GRAIN (p)  «  rica  **»>  rainfall (p)  »  haavy 


sCoftMpo<*5ng  to  tha  iltom  OaUv&ca  in  loge 

sAi  (***•>*£  wa  ftuuma  that  potaotki  kVtgutiM  utocatad  w<h  tha  maaiung  of  tha  tnaguancy  pinnafar  •  a  g  cioat  *  ’a**'  to 
aptoa  of  ima  u*  accounted  lex  by  ccntiXM  tftwpntiAon  by  tha  uaar 
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certainty  *  .9,  likelihood  *  HIGH. 

The  fourth  type  of  certainty  parameter  stored  with  the  declarative  knowledge  of  the  system  is 
dominance.  A  dominance  parameter  is  associated  with  every  set/subset  link  in  the  system.  It  measures 
the  proportion  of  elements  in  the  subset  out  of  all  elements  in  the  set.  For  example.  PART* 
OF(ENGLAND)  »  SURREY  would  have  low  dominance,  since  Surrey  is  a  small  part  of  England. 


5.  MULTIPLE  TYPES  OF  INFERENCE 

The  current  version  of  PRSS  implements  three  basic  types  of  functional  inferences  on  statements 
retreived  from  its  memory,  depending  on  the  kind  of  dependency  found  and  the  resulting  kind  of 
contextually-based  similarity  match  required.  The  three  types  are  functional  analogies ,  whLh  are  based 
mutual  dependencies  between  descriptors,  implication  inferences,  and  set/subset  inferences. 

In  the  example  below,  we  show  how  the  system  is  able  to  construct  throe  separate  'proofs*  that  the 
climate  of  England  is  temperate.  Given  the  data  in  memory  provided  for  this  example,  the  system  is 
unable  to  construct  a  single  proof  that  the  climate  of  England  is  not  temperate. 


(?  climate  :of  england  :=  temperata) 

Uaing  an  Inheritance  transform: 

Since  ENGLAND  ■  PART -OF (EUROPE)  (dom  >  LOW) 

And  EUROPE  has  CLIMATE  ■  TEMPERATE  (certainty  ■  HIGH) 

Conclude  that  CLIMATE  (ENGLAND)  -  TEMPERATE  i  a  TRUE  with  MED  certainty. 


Oaing  an  Implication  tranaform: 

Since  LATITUDE  -  SECOND-QUAD  or  THIRD-QUAD  —>  CLIMATE  •  TEMPERATE 
and  LA* .TUDE (ENGLAND)  -  THIRD-QUAD 

Conclude  that  CLIMATE  (ENGLAND)  •>  TEMPERATE  la  TRUE  with  MEDIUM  certainty. 


TRYING  ARGUMENT  BASED  DEPENDENCY  TRANSFORMS .... 

Uaing  a  SIM  tranaform  I  reaaon: 

Since  LATITUDE  <-*»«>  CLIMATE 

and  HOLLAND  ia  similar  to  ENGLAND  with  reapect  to  LATITUDE-  (aim  >1.0) 
and  CLIMATE  (HOLLAND)  -  TEMPERATE. 

Conclude  that  CLIMATE  (ENGLAND)  -  TEMPERATE  i«  TRUE  with  MEDIUM  certainty. 


TRYING  REFERENT  EASED  DEPENDENCY  TRANSFORMS . 

Insufficient  Information  Available. 

I  conclude  CLIMATE  (ENGLAND)  -  TEMPERATE,  (certainty  -  HIGH)  . 


One  general  class  of  functional  inference  is  called  statement  transforms  (Colons  and  Michatskt  in 
press).  This  type  of  inference  requires  a  declarative  rule  called  a  dependency.  In  the  example  above,  an 
analogy  is  made  between  England  and  Ho  Hand.  The  system  is  aware  of  a  general  relationship  that  the 
cSmate  of  a  place  is  dependent  upon  the  latitude  of  a  place.  Ir  order  to  determine  whether  a  specific 
fetation  exists  between  a  lahtude  in  the  third-quad  (*5-67.5  deg.)  and  a  temperate  climate  the  system 
must  find  an  instance  analogous  to  England  which  is  known  to  have  a  temperate  climate.  Holland  is  such 
an  instance.  Since  Holland  and  England  have  the  same  latitude  the  system  can  conclude  that  England 
can  have  a  temperate  climate  as  wen 
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Argument-based  Transforms 

GEN:  flower-type  (Europe)  “{daffodils,  roses...} 
SPEC:  flower-type (Surrey) ={d*££odils,  roses...) 

SIM:  flower-type (Holland) •{daffodils,  roses...) 

DZS :  flower-type (Brazil) #{ daffodil a,  roses...) 

Reference-based  Transforms 

GEN:  flower-type (England) ■ {temperate  flowers...} 

SPEC:  flower-type (England) =( yellow-roses . . . ) 

SIM:  flower-type (England) »{ peonies . . . ) 

DIS:  flower-type (England) #( bougainvillea. .  ) 


Figure  5-1 :  Eight  Transforms  on  *ftower-type(EnglandMDaffodils.  roses...} 

Within  the  class  of  statement  transforms.  Collins  and  Michalski  (in  press)  descrioe  eight  different  kinds 
of  transforms,  four  argument-based  transforms,  and  four  reference-based  transforms.  The  eight 
inference  transforms  were  derived  by  considering  concepts  related  to  the  ones  mentioned  in  the  question 
asked,  where  the  relationship  could  be  any  of  generalization,  specialization,  similarity,  and  dissimilarity 
Each  of  these  operators  could  be  applied  to  either  the  argument  or  the  referent  in  the  question  statement, 
giving  the  total  of  eight  specific  transforms.  Figure  5-1  gives  an  example  of  each  of  the  eight  transforms 
for  the  statement  FLOWER-TYPE(ENGLAND)«{daffodils,  roses...}.  The  overall  certainty  of  an  inference 
based  on  one  of  these  transforms  depends  on  the  degree  of  similarity  or  typicality  of  the  concepts  related, 
as  compared  along  the  dimensions  specified  in  trie  dependency  used,  and  the  degree  of  certainty  of  the 
dependency  itself. 

The  dependency  used  in  the  exempts  above  can  be  described  in  the  predicate  calculus  as. 

V  pl,p2, 1, c  PLACE (pi)  a  PLACE (p2 )  a 

LATITUDE  (pi,  1)  a  LATITUDE (p2,l)  a  CLIMATE  (p2 .  c.) 

CLIMATE  (pi,  C) 

t  o  if  two  places  match  on  latitude  then  they  wii  match  on  climate. 

The  simplest  type  of  functional  inference  is  based  on  a  type  of  declarative  inference  rule  called  an 
UnpOcatlon.  implication  inferences  can  be  used  to  infer  values  for  properties  on  the  basis  of  other 
properties  o#  the  same  concept.  Since  the  precise  reUttion  is  conrpJetefy  specified  in  an  implication,  an 
analogous  instance  is  not  required  for  its  application.  The  implication  used  In  the  example  above  n  be 
expressed  using  the  predicate  calculus  as, 

V  x ,  PLACE (X)  a  LATITUDE (x, THIRD-QUAD)  — >  CLIMATE (x, TEMPEAATE) 

i.s.  if  the  latitude  of  a  plscc  d  third-quad  then  the  cfemssS  of  that  place  i*  temperate 
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In  the  next  example,  the  system  first  generates  an  argument-based  statement  transform  using  a 
dependency  whose  consequent  is  the  queried  descriptor,  FLOWER-TYPE.  It  finds  a  place  where  tulips 
are  grown  (Holland)  and  compares  that  place  to  Venezuela  on  the  antecedent  descriptor  of  the 
dependency,  CLIMATE.  Since  they  do  not  match,  it  concludes  that  tulips  don’t  grow  in  Venezuela.  The 
second  Inference  is  a  reference-based  transform.  Here,  a  dependency  is  required  whose  consequent  is 
the  inverse  of  the  query  descriptor  FLOWER-TYPE  (i.e  GROWS-IN),  since  one  needs  to  find  a  flower  that 
grows  in  Venezuela  and  which  is  similar  to  tulips  with  respect  to  the  factors  that  affect  flower  growth  in  a 
place.4 


(7  flow«r*typ«  :of  Venezuela  :=  tulip) 

TRYING  ARGUMENT  BASED  DEPENDENCY  TRANSFORMS .... 

Using  a  OZS  transform  I  rwaaon : 

Since  CLIMATE  <-— >  FLOWER-TYPE 

end  HOLLAND  1«  diaaifiilar  to  VENEZUELA  with  respect  to  CLIMATE. 

(•im  *  -1.0) 

end  FLOWER-TYPE (HOLLAND)  •  TULIP. 

Conclude  that  FLOWER-TYPE  (VENEZUELA)  -  TULIP  ia  FALSE  with  LOW  certainty. 


TRYING  REFERENT  BASED  DEPENDENCY  TRANSFORMS . 

Oaing  a  DIS  transform  I  reason: 

Since  CL  I  MATE -CF  <— «>  GROWS-IN 

end  BOUGAINVILIXA  ia  dissimilar  to  TULIP  with  respect  to  CLIMATR-QF. 

(aim  »  -1.0) 

end  GROWS-IN (BOUGAINVILLEA)  -  VENEZUELA. 

Conclude  that  GROWS- IN  (TULIP)  •  VENEZUELA  ia  FALSE  with  LOW  certainty. 
I  conclude  TULIP  IS  NOT  FLOWER-TYPE  of  VENEZUELA,  (certainty  -  MED)  . 


6.  COMPUTING  THE  CERTAINTY  OF  AN  INFERENCE 

Each  of  the  examples  shown  so  far  involves  several  types  of  inference,  and  the  certainty  of  each 
inference  is  based  on  a  combination  of  several  certainty  parameters  and  a  smvtanty  or  typicality  measure 

The  two  similarity  parameters  computed  by  the  matcher  are  similarity  and  typicality.  At  present, 
these  two  parameters  measure  the  quality  of  a  match  and  are  computed  m  exactly  the  same  way.  The 
difference  between  them  is  that  typicality  af  lies  when  a  property  (properties)  of  a  set  is  being  matched 
with  those  of  a  subset  and  similarity  is  comp  'ed  as  the  quality  of  a  match  between  two  subsets.  In  the 
theory,  similarity  (or  typicality)  measures  the  qualty  of  the  match  either  of  a  single  feature  or  of  a  bundle 
of  features. 

tn  the  current  implementation  we  compute  the  similarity  (or  rypicaHty)  of  a  single  feature  with  muispie 
known  values  by  an  urn  model  type  algorithm.5  The  similarity  parameter  is  currently  computed  as  the 


*Th4  tyrtatn  l***  •  krtawfedg*  f«prt*w»Uton  (1  which  th»  (fetch ptof  (fefirW&ar.*  m*y  tptefy  nv*i«  Tho  (fe*chptc-- 
R.OWER-TYPE  hu  b*on  dafvwd  u  htwtp  a  dont*so  ttul  mutt  b*  t  PLACE,  s  rag*  Ssi?  mutl  b*  a  ROWER.  and  *i  n vtic 

anwga-Typs  >-<-  pulses  b*5  b*&  rowers  that  grows- in  >n*p* 

ROWERS  nto  th*  PLACES  what*  th*y  gw 

Hr  Bw  Aura,  <m  p fen  to  tdMd  tfw  mtrctw  to  comp *j»  mdbfi*  feafirat  with  mutpfe  vcjUm 
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probability  that  two  values  for  a  given  feature,  chosen  at  random  within  their  frequency  distributions, 
match  or  mismatch. 

The  certainty  of  each  individual  inference  is  currently  computed  as  the  minimum  of  all  the  certainty 
parameters  and  match  certainties  used.  This  includes  the  certainties  associated  with  every  proposition 
used,  the  certainty  and  the  likelihood  of  the  inference  rule  and  similarity  measure  returned  by  the  matcher. 

Once  the  system  has  constructed  every  possible  proof  for  2  given  proposition  it  must  determine 
whether  the  proposition  is  true  or  false  and  estimate  the  certainty  of  its  guess.  Currently  this  is  done  by 
weighing  the  evidence  for  the  proposition  with  the  evidence  against  that  proposition.  The  certainties  of  all 
of  the  positive  conclusions  are  combined,  and  all  of  the  negative  conclusions  are  combined.  Multiple  line? 
of  evidence  in  a  given  direction  increases  the  certainty  of  the  conclusion  for  that  direction.  The  firu 
judgment  is  the  direction  with  the  greater  certainty,  and  the  certainty  of  that  judgement  is  cownwit  vted 
by  the  certainty  of  the  conclusion  in  the  opposite  direction. 

7.  CONCLUSION 

This  work  is  still  in  its  early  stages  and  yet  already  we  see  a  number  of  interesting  issues  that  will 
require  further  study.  To  date,  we  havts  r>ot  run  the  simulation  with  large  numbers  of  facts  in  memory,  and 
we  forsee  that  this  will  cause  the  number  of  inferences  the  system  makes  to  grow  exponentially.  Clearly, 
techniques  will  be  needed  to  control  this  growth,  such  as  the  filtering  of  weak  and  redundant  inferences, 
the  use  of  prototypes  when  many  similar  examples  exist,  and  more  sophistocated  representations  ror 
complex  dependencies  and  implications.  We  also  need  to  develop  better  and  more  efficient  techniques 
for  similarity  matching,  if  we  are  to  do  matches  on  many  contextual  features  at  once  As  the  model 
continues  to  develop,  we  will  also  begin  ?  new  round  of  protocol  expenments.  in  order  to  test  our  rrv'-Jel. 
and  answer  some  of  the  questions  discovered  by  computer  modeling. 


jniMwu<raiiviciHiuMwifwi£>iuwuuwuuuuu%H;uHuiiuvAXAKKKTCttumxiunw?ur|urKv^ 
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Table  i 

Incomplete  Data  Base  ou  Grain  Growing  In  Different  Ccuacriea 
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Mild  Winter 
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Even  Rain 


Winter  Rain 
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Plains 
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Plains 


Mountains 

Plains 
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